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Abstract 

This paper builds upon our recent work which computed the moment generating function of the 
MIMO mutual information exactly in terms of a Painleve V differential equation. By exploiting this 
key analytical tool, we provide an in-depth characterization of the mutual information distribution for 
sufficiently large (but finite) antenna numbers. In particular, we derive systematic closed-form expansions 
for the high order cumulants. These results yield considerable new insight, such as providing a technical 
explanation as to why the well known Gaussian approximation is quite robust to large SNR for the case 
of unequal antenna arrays, whilst it deviates strongly for equal antenna arrays. In addition, by drawing 
upon our high order cumulant expansions, we employ the Edgeworth expansion technique to propose 
a refined Gaussian approximation which is shown to give a very accurate closed-form characterization 
of the mutual information distribution, both around the mean and for moderate deviations into the tails 
(where the Gaussian approximation fails remarkably). For stronger deviations where the Edgeworth 
expansion becomes unwieldy, we employ the saddle point method and asymptotic integration tools 
to establish new analytical characterizations which are shown to be very simple and accurate. Based 
on these results we also recover key well established properties of the tail distribution, including the 
diversity-multiplexing-tradeoff. 
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I. Introduction 

Multiple-input multiple-output (MIMO) technologies form a key component of emerging broad- 
band wireless communication systems due to their ability to provide substantial capacity growth 
over power and bandwidth constrained channels. Such technologies have received huge atten- 
tion for over a decade now, with recent trends focusing mainly on incorporating MIMO into 
complicated system configurations; e.g., those employing relaying [1-5], cooperative multi-cell 
processing [6-8], information-theoretically secure systems [9-11], and ad-hoc networking [12- 
15]. However, despite the huge progress, some fundamental questions regarding the information- 
theoretic limits of MIMO systems still remain unclear, even for the simplest point-to-point 
communication scenarios. Of these, one the most important is the characterization of the outage 
capacity, which gives an achievable rate for transmission over quasi-static channels. 

Characterizing the outage capacity is much more difficult than the ergodic capacity, since 
it requires solving for the entire distribution of the channel mutual information, rather than 
simply the mean, and thus far this distribution is only partially understood. For example, in [16], 
assuming independent and identically distributed (IID) Rayleigh fading with perfect channel 
state information (CSI) at the receiver, the mutual information distribution was characterized via 
an exact expression for the moment generating function (MGF). This result was given in terms 
of a determinant of a certain Hankel matrix which yields little insight and becomes unwieldy 
when the number of antennas are not small. A similar determinant representation for the MGF 
was adopted in [17] to provide a saddle point approximation for the cumulative distribution 
function (CDF), however the solution was complicated and once again revealed little insight. An 
alternative result was presented in [18] which considered the MGF of the mutual information at 
high signal-to-noise ratios (SNR) and used this to establish a Chemoff bound on the CDF. Whilst 
this approach avoided dealing with complicated determinants, simulations demonstrated that the 
bound was not particularly tight, particularly when considering the CDF region representing 
outage probabilities of practical interest. As an alternative method to characterizing the mutual 
information distribution through its MGF, [19,20] took the more direct approach of using 
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classical transformation theory to derive exact expressions for the probability density function 
(PDF) and CDF for MIMO systems with small numbers of antennas. It was shown however, that 
even for the simplest MIMO configuration with dual antennas (i.e., having two transmit or two 
receive antennas), closed-form solutions were not forthcoming and one must rely on numerically 
evaluating complicated integrals. 

To overcome the complexities of finite antenna characterizations, another major line of work 
has focused on giving a large-antenna asymptotic analysis, which provides more intuitive results; 
see e.g., [21-24]. In such analyzes, the most well known conclusion is that the mutual information 
distribution approaches a Gaussian as the number of antennas become sufficiently large. Different 
approaches have also been employed to derive closed-form expressions for the asymptotic mean 
and variance [21-26]. Quoting [24] as an example, for a MIMO system subjected to IID Rayleigh 
fading with perfect receiver CSI, with Ut transmit antennas, Ur receive antennas, and SNR P, 
if rit and rir are both sufficiently large then the mutual information distribution is approximated 
by a Gaussian with mean /iq and variance ctq given as 



/io = n 



a + b^ / y/fj + aP + y/fj + bP 



(/3 + bP) + VM/S + \ (V/3 + a^-v/^T6P)' 



2 In 
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2 \ + 



2 \^^aP 
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(2) 



where 



n := min{nt, n^}, m := max{nt, Ur}, /3 := m/n, a := ^a/ /? + 1 — 1 j , b := ^\/ /? + 1 + 1 

The Gaussian approximation has been considered extensively due to its relative simplicity 
compared to the exact characterizations. However, in practice, the number of antennas in MIMO 
systems is typically not huge, and it turns out that the Gaussian approximation may sometimes 
be very inaccurate. These deviations have been reported in several previous contributions [17, 
24,27], and here we give two concrete examples to demonstrate them. 
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As a first example, as shown in Figs. l(a)-l(b), the PDF of the Gaussian approximation 
deviates significantly from the true (simulated) PDF of the mutual information for finite antenna 
arrays when the SNR becomes large. This phenomena was emphasized and investigated in 
our recent work [24] for the specific case of equal-antenna arrays (i.e., rit = Ur) by looking 
at the cumulants of the mutual information. Quite interestingly, the figures indicate that the 
deviation from Gaussian is significantly stronger when = Ut compared with the alternative 
case, despite the fact that there are more antennas. Thus, when rit ^ n^, it appears that the 
Gaussian approximation is more robust to increasing SNR. This key observation is unexpected, 
and to the best of our knowledge it hitherto lacks any rigorous explanation. 
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Fig. 1. PDF of the mutual information of MIMO Rayleigh fading channels for different antenna configurations and different 
SNRs. 



As a second example, aside from the deviation observed in the bulk of the distribution for 
large SNR, it turns out that the Gaussian approximation is typically very inaccurate in the tail of 
the distribution when n is finite and the SNR is either small or large. This is observed in Figs. 
2(a)-2(b), where in both cases the Gaussian curve fails markedly in tracking the simulations for 
small but practical outage probabilities. This strong deviation from Gaussian was also discussed 
in [27], where a refined approximation was presented based on adopting an intuitive large-n 
Coulomb fluid interpretation from statistical physics, leading to a set of coupled non-linear 
equations requiring numerical computation. As discussed therein, the main utility of the large 
deviations approach is that it allows one to capture the tail behavior in the regime of 0{n) 
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deviations away from the mean, whilst the Gaussian is restricted to capturing deviations which 
are close to 0(1). 
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Fig. 2. CDF of mutual information of MIMO Rayleigh fading ciiannels for small and large SNRs. 

Obtaining a clear and rigorous understanding of the distributional behavior indicated above 
appears difficult with random matrix theoretic tools which are currently well known to infor- 
mation and communication theorists, such as those based on the Stieltjes transform [23] and 
the replica method [21,26]. To address this problem, in our recent work [24] we introduced a 
powerful methodology involving orthogonal polynomials and their so-called "ladder operators" 
which led to a new and convenient exact characterization of the MGF of the mutual information in 
terms of a Painleve differential equation. This valuable representation provides the machinery for 
systematically capturing the finite-n corrections to the asymptotic Gaussian results, and thereby 
investigating deviations from Gaussian and correcting for them. Some related subsequent work 
for interference-limited multi-user MIMO systems was also presented in [28]. However, whilst 
the Painleve MGF representation in [24] laid the platform for further analysis, the main focus of 
the analysis presented therein was restricted to the case rit = rij.. Moreover, even for this case, 
a rigorous treatment of the large-deviations region considered in [27] was not pursued. 

In this paper, we significantly expand upon our existing studies in [24] to provide a much 
broader characterization and understanding of the MIMO mutual information distribution. Start- 
ing with the exact Painleve representation for the mutual information MGF derived in [24], for 
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sufficiently large n (but not assuming that rij = n^), we systematically compute new closed- 
form expansions for the higher order cumulants of the mutual information distribution. These 
series expansions reveal interesting fundamental differences between the two cases, Ut ^ riy. and 
rit = rir. Notably, it is demonstrated that the typical approach of considering only the leading- 
order terms in the large-n series expansions for each cumulant (e.g., (1) and (2) respectively 
for the mean and variance) is relatively stable for the asymmetric case ^ rir compared with 
the symmetric case rit = Ur, when the SNR P becomes large. This is because the correction 
series for each cumulant (i.e., comprising all terms other than the leading n term) is shown to 
converge to a bounded constant as P increases, and therefore becomes quite small relative to the 
leading term when P is large. For the symmetric case, on the other hand, the situation is very 
different — the correction series for each cumulant not only grows with P, but also at a faster 
rate than the leading term; a phenomenon which was discussed at length in [24]. These results 
allow us to provide a technical explanation for the intriguing large-SNR phenomena observed 
in Figs. 1(a) and 1(b). 

In addition to gaining fundamental insight into the behavior of the mutual information distribu- 
tion, we also provide new accurate analytical characterizations for the distribution by employing 
two different approaches, each being useful in their own region of interest. First, we draw 
upon the Edgeworth expansion technique along with our derived cumulant expressions to give 
a simple refined closed-form distribution approximation which is shown to be very accurate 
around the mean and also for certain moderate deviations into the tails. These results generalize 
previous Edgeworth expansions which were derived for the specific case of nt = rir in [24]. It 
is shown that in contrast to the Gaussian approximation, which is only capable of successfully 
characterizing near 0(1) deviations around the mean (or the bulk) as n increases, the Edgeworth 
expansion captures deviations of up to 0{n''), where < e < 1. Here, e is a key parameter 
which, as it increases, allows the approximation to capture the correct distribution further into 
the tails, however it also requires the addition of more cumulants to be included in the Edgeworth 
series, thereby increasing the complexity. In the extreme case, as e — )■ 1, a// cumulants must be 
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included; thus, the Edgeworth technique becomes unwieldy and alternative methods are required. 
In this scenario, in order to capture such "large deviations" into the tail, we exploit the saddle 
point method along with asymptotic integration tools to give further analytical representations. 
Very simple formulas are obtained for the cases of high and low SNR which, taken together, 
are shown to be very accurate over almost the entire range of SNR values. We point out that 
our saddle point results provide, in effect, an alternative characterization to the results proposed 
in [27], which also considered the "large deviations" regime, focusing on 0{n) deviations from 
the mean. Our results, however, are based on a more rigorous footing, stemming from the exact 
Painleve representation for the mutual information MGF, as opposed to intuitive statistical physics 
analogies. Quite surprisingly, they are also found to be simpler. 

As a final step, to further emphasize the utility of our results and methodology, we use our 
analytical framework to extract the well known diversity-multiplexing-tradeoff (DMT) formula 
of Zheng and Tse [29] which relates to the large deviations region in the left tail at large SNR, 
whilst also deriving similar results for the right tail. These results are found to be consistent 
with those obtained via the Coulomb fluid analogy in [27]. 

The rest of the paper is structured as follows. Section II describes the system model under 
consideration and introduces the Painleve representation for the MGF of the mutual information, 
which is the key tool underpinning our analysis. Then, in Section III, we provide a systematic 
derivation of the mutual information cumulants to leading order in n, as well as finite-n correction 
terms, giving closed-form expressions in both cases. Based on these new results. Section IV ana- 
lyzes the n-asymptotic Gaussian approximation at large SNR, revealing fundamental differences 
between the two scenarios, rit = n,. and rit ^ n,.. In Section V, we draw upon our cumulant 
expressions and the Edgeworth expansion technique to provide a refined approximation to the 
mutual information distribution, both around the mean and for moderate deviations into the 
tails. Subsequently, Section VI exploits the saddle point method and asymptotic integration tools 
to characterize the "large deviations" region, extracting key behavior such as the DMT. In the 
Appendix, correspondences are drawn between our analytical framework and the Coulomb fluid 
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large deviations method used in [27]. 

II. System Model 

Considering a point-to-point communication system with rij transmit and rir receive antennas, 
under flat-fading, the linear MIMO channel model takes the form: 



where y G C"'^ and x G C"* denote the received and transmitted vector respectively, whilst 
H G C"""^"' represents the channel matrix, and n„^xi G C"' represents noise. Assuming rich 
scattering, H is modeled as Rayleigh fading, having IID entries hi^ ~ CA/'(0, 1), known to 
the receiver only. The noise is assumed n ~ CA/'(0,I„ ). The input is selected to be the 
ergodic-capacity-achieving input distribution x ~ CA/" ^0, j , where P is the transmitted 
power constraint and also represents the SNR due to the normalized noise. For the model under 
consideration, the mutual information between the channel input and output is [30]: 



According to (4), we can assume rit > nr without loss of generality; otherwise, if rit < nr, we 
only need replace P with (3 P. 

Throughout the paper, we make the well known assumption that the channel exhibits block- 
fading, such that the fading coefficients vary independently from one coding block to another, 
but remain constant for the duration of each block. In this case, the outage probability becomes 
an important performance indicator, and the "capacity-versus-outage" tradeoff comes into play 
[31]. Quantifying this tradeoff requires the entire distribution of the mutual information (4). 
Consequently, a common approach is to investigate the MGF of I(x; y): 



y = Hx + n 



(3) 




(4) 



A1(A):=E[exp(AX(x;y))] 




(5) 
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With this, the cumulant generating function (CGF) can be expressed as a power series about 
A = 0: 

OO y, 

/C(A):=lnA^(A) = ^K,- (6) 

£=1 

where the coefficient is the f-th cumulant of X(x; y). 

To lay the foundation of our analysis, we first quote the following exact representation for 
the MGF (5) (or equivalently, the CGF) in [24]; a key result derived by the authors by drawing 
upon methods from random matrix theory (see e.g., [32-34]). 

Proposition 1: The MGF (5) admits the following compact representation: 

M{\) = exp(^ j^"' ^^d:^ (7) 

where Gn{x) satisfies a version of the Painleve V continuous a-form: 

{xG'lf =n^ (xG; + G"„ (n + m + A) jn - G^f 

- 4 (xG; - Gn + nm) {G'^ + XnG'^) , (8) 

with ' denoting the derivative with respect to (w.r.t.) x. 

Whilst an explicit solution to the Painleve V differential equation does not exist in general, 
we will show in the next subsection that it can be used to great effect to extract deep insight 
into the behavior of the mutual information distribution. In particular, we will use it to provide 
a systematic method for computing closed-form expressions for the leading-order and correction 
terms to the mean, variance, and higher order cumulants, as the numbers of antennas grow 
large. This will allow us to obtain more accurate characterizations than the asymptotic Gaussian 
approximation, which is based on only the leading order terms of the mean and variance. 

in. Systematic Derivation of the Mutual Information Statistics 

In this section, we further elaborate upon our work in [24, Section IV] by making use of 
the Painleve representation (i.e.. Proposition 1) to obtain closed-form expressions for the higher 
order cumulants to leading order in n, as well as first-order correction terms, which apply for 
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arbitrary (3. Computing the mutual information distribution in terms of its cumulants enables 
us to obtain insights into the shape of the distribution, for example, studying its "Gaussianity". 
Furthermore, by invoking an Edgeworth expansion approach, we will evaluate approximations 
for the outage probability in closed-form, avoiding explicit computation of the inverse MGF (i.e., 
an inverse Laplace Transform). A key contribution in this section is the generalization of the 
results in [24, Section IV] beyond the case /3 = 1, and, based on these results, we will see that 
the mutual information distribution exhibits some key differences in the two scenarios, (3 = 1 
and (3>l. 

A. Power Series Method: Valid for Small P 

For a preliminary study, we use the power series method to seek solutions to (8). Noting that 
limp_i.o In A^(A) = 0, we assume that for sufficiently small P, and thus sufficiently large x, that 
Gn{x) admits an expansion of the form: 



for some hi, 62, • • •• Substituting this power series into (8) and solving for the fe^'s by matching 
the coefficients of \/x^ on the left and right-hand sides, we write the first few fc^'s as follows: 



62 = n^(3 (/3 + 1) A - n^f3X^ , 

63 = -n^p [n^p^ + 3n^P + + l) X + 3n^P {/3 + 1) - n^/3X^ , 

64 = [5n3/3 (/3 + 1) + (/? + 1) + 5(3 + l)] A 

- [n^P (6/32 + 17/3 + 6) + 5n^P] X^ + 6n^P (/3 + 1) A=^ - n^/JA^ , 

65 = - [Sn^p + n^p (15/32 + 40/3 + 15) + n^p (/j^ + iQp^ ^ 20/3^ + 10/3 + l)] A 

+ [n^P (10/32 + 55/32 ^ ^ ^ 4Q^3^ ^ ^2 

- [n^P (20/32 + 55/3 + 20) + 15n^p] X^ + lOn^P (/3 + 1) /3A^ - n^pX^ . 




(9) 



k=l 
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Introducing a rearrangement of the power series (9) according to the order of A, we obtain an 
expression akin to the power series representation of the CGF (6): G„(x) = \gi{x) + \^g2{x) + 
^'^9z{x) + ■ ■ • where ge,i= 1, 2, . . . are each a power series in 1/x, which we omit for the sake 
of conciseness. By using (7), the i-th cumulant ki is then computed by 

As a result of this procedure, here we write the first few k/s, showing the leading order and 
first-order correction terms in n: 



^ ='y-^^^ + — w — ^ 

1 /5P^ 16 (/3 + 1) 175/3^ + 456/3 + 175 , \.n(- 
n2 V2/33 /34 ^ 3/35 y' 



_ 1 /2P3 9 (/3 + 1) ^ 24/3^ + 66/3 + 24 
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B. General Method: Valid for Arbitrary P 
From the power series expressions (1 1)-(15), we observe the following large n series structure: 



Note that Cij is independent of n, but depends on P and /3. This structure gives an important 
general understanding of the mutual information distribution in terms of cumulants, and also 
paves the way for our arbitrary P analysis. As seen from (16), as n grows large with other 
parameters fixed, only the leading terms of the mean and variance survive, hence the mutual 
information distribution approaches Gaussian with increasing n. This agrees with previous results, 
see e.g., [16,22,23]. 

In the previous subsection we solved Cj j for small P. Here, we present a more general 
approach to compute these quantities in closed-form for arbitrary P. First, we establish a recursive 
computation for all cumulants to leading order in n (i.e., generating Cefi,i = 1,2,... along 
the leading diagonal in (16)). Then, employing the same machinery, we obtain a recursion for 
systematically computing the first-order correction terms for all cumulants, allowing us to derive 
closed-form expressions for Ce^i,£ = 1, 2, . . . along the second main diagonal in (16). This, in 
tum, allows us to capture improved accuracy for finite n. In addition to computing the terms 
along the diagonals of (16), we also demonstrate how our machinery based on the Painleve MGF 
representation can be employed to systematically compute the horizontal power series expansion 
in (16) for a given cumulant, to arbitrary degree of accuracy. For this, we focus on the series 
computation of the mean by way of example, with the other cumulant series being computable 
in a similar way. 



/i = n Ci^o 




+ ... 



+ ... 



(16) 
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1) Cumulants to Leading Order in n: Recalling the definition of the cumulants in (6) and 
their large-n series structure in (16), we start by making the replacement A n\ and taking n 
large, such that Gn{x) admits 

Gn{x) = n'^Y{x) + O (1) (17) 

where Y{x) = \yi{x) + X^y2{x) + ■ ■ ■ , independent of n. Here, yeix) corresponds to the leading- 
order term (in n) for the i-th cumulant by the following integral: 

Ce,o = £l / ^^dx, £ = 1,2,... (18) 

The remaining challenge is to compute closed-form formulas for y£{x), £ = 1, 2, . . ., in order to 
obtain the corresponding Cifl. Substituting (17) for Gn{x) into (8) (with A n\) and keeping 
only the leading-order terms in n (i.e., 0{n^)), we obtain the following equation involving^ 

Y(x): 



[{x + (3 + \ + l)Y' -Y]^ = 4: [xY' - r + /3) (r' + A) Y' . (19) 

Note that this equation completely characterizes all of the cumulants to leading order in n (i.e., 
based on this, we may compute o, £ = 1,2,... in the structure (16)). This can be done upon 
substituting Y{x) = \yi{x) + \^y2{x) + ■ ■ ■ into (19) and matching the coefficients of A'^ on 
the left and right-hand side. Using this approach, we systematically obtain equations involving 
yfc(x). A; = 1, 2, . . . which are elaborated upon below. 

The case k = \ characterizes the mean of the mutual information. In this case, we obtain the 
non-linear differential equation involving yi{x): 

[x^ + 2{P + l)x + {P- if] {y[f -2{x + P + l) y[y^ - APy[ + = o , (20) 

'Here and henceforth, without causing confusion, we will omit the argument x when presenting differential or difference 
(recursion) equations. 
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which can be explicitly solved to give 



sjx^ + 2{l3 + l)x + {l3-lf - X - (/3 + 1) 



yi{x) = ^ . (21) 

Integrating yi{x) via (18), we find a closed-form expression for Ci^. With this expression, we 
find that nCi o agrees precisely with the n-asymptotic mean of the mutual information (i.e., /io 
in (D). 

The case k = 2 characterizes the variance of the mutual information. In this case, we obtain 
the non-linear differential equation involving yi (x) and y2 (x) : 

{ -2 [x^ + 2 (/3 + 1) X + (/3 - 1)'] y'l + 2 (x + /3 + 1) + 4/3} y', 

+ 2[{x + P + l)y[- yi] y2 + y[ {Ax {y[f + 2 {x - 13 - 1) y[ - 4y[y, - 2yi} = 0. (22) 

Whilst seemingly complicated, quite remarkably, once we substitute yi{x) with (21), the coef- 
ficient of y2{x) vanishes, i.e., 

-2 [x^ + 2 (/3 + 1) X + (/3 - 1)'] y[ + 2{x + f3 + l)yi+4f3 = 0, (23) 

and the differential equation (22) collapses to a simple algebraic equation in y2{x). The solution 
is then easily obtained as: 

= , - i £!±(£±i)£ , (24) 

2\/x= + 2(/!+l)x + (/!-l)^ 2 x^ + 2(/! + l)x + (/3-lf 

Integrating y2{x) via (18), we obtain a closed-form expression for 6*2,0 which is found to agree 
precisely with the approximation for the ra-asymptotic variance of the mutual information (i.e., 
(T^ in (2)). 

The higher order cumulants, beyond the mean and variance, are characterized by the case 
k > 2. Evaluating and investigating these cumulants is important, since it allows one to study 
deviations from Gaussian for finite dimensions. As we will see, these higher order cumulants 
may also be used to "refine" the Gaussian approximation to provide increased accuracy. 
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For k > 2, yk{x) are found to satisfy the following recursion: 

k k 



i=l i=l 



k-1 



i=l 



k—i 

(2a;yjZ/Li_j+i-2?/jyfc-i-j+i) + (x-/3-l) y'k^i-Vk- 

.i=i 



k~2 



-J2yiyk-^-l (25) 



i=l 



with initial conditions yi{x) and y2{x) given in (21) and (24) respectively. In theory, the recursive 
equation (25) enables us to systematically compute the leading order expressions (in n) in closed- 
form for any desired number of higher order cumulants, in sequence. In its current form, however, 
(25) appears very complicated. Fortunately, this expression can be simplified considerably by 
observing that the only term in (25) which involves yk{x) (or its derivative) is: 

{[x' + 2{^ + l)x+{^-lf]y[-{x + P + l)yi-2/3}y',-[{x + ^ + l)y[~yi]yk, (26) 

with all other terms involving the previously computed yi(x),i < k. As for the variance, quite 
remarkably, the term (26) simplifies considerably upon noting that the coefficient of y'^{x) is 
precisely zero, by virtue of (23). This interesting observation indicates that the computation of 
the higher cumulants, for k > 2, only involves solving simple algebraic equations in yk{x), rather 
than non-linear differential equations involving yk{x) and y[{x). More specifically, the recursive 
equation (25) collapses to the following: 

_ Rk-Eti { x+iP-lf] y[yU+^-2 (x + /3 + 1) y.Wm + y^yk-^+l] 

2[{x + P + l)y',-y,] 

(27) 

for k = 3,4, . . ., where Rf, represents the right-hand side of (25), which depends only on the 
previously computed yi{x), i < k. With this result, yk{x) can be easily computed, systematically 
in closed-form, for any value of k. We simply write down 1/3 (x) and 2/4 (x) here by way of 
example: 



1 / X + /3 + 1 - a/x2 + 2(/3 + 1)x + (/3-1)' 
y3{x) = --x{x + l-f3){x-l + f3)\ V I ^ (28) 

2 ^ (x2 + 2(/3 + l)x + (/3-l)' 
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X + (/3 + 1) - (/j - 1)' [Qx^ + 3 (/3 + 1) X - (/3 - 1)'] 
2 (2;2 + 2(/3 + l)x + (/3-l)')'^' 



^ X -3x^-4 {l + l3)x^ + {l3~lf [5a;' + 9 (/j + l)x^ + 2 (/3 + l)'a;-(/3 + l) ^^^^ 
2 (3;2 + 2(/3+l)x+(/3-lf)' 

Integrating via (18), we obtain 6*3,0 and C4,o in (16) in closed form: 

2/3 /32 + 2 (/3 + 1) /3P + (/3 - 1)' P2 

l(/3-l)'[(/32 + l)p3 + 3/3 (/3 + l)p2]+3/32 (;32 + l)P + /33 + 



/3 + 2 (/3 + 1) /3P + (/3 - 1)' P2)=^/2 
/32 + I 18/3^+28/33 (/3+1) P+(/3-l)2 [3/32p2_6/3(/3+l)p3+(/3-l)2p4] 

-'4,0 ' 



(30) 



C4.0 — :r7;7^ — I ^ — : -P"^ 



2/5' (/32 + 2 (/3 + 1) /3P + (/3 - if P2) 

1 



+ 



2/32 (/32 + 2 (/3 + 1) /3P + (/3 - 1)' P2)'/' 



(/32 + 1) /35 + 5/3^ (/3 + 1) (/32 + 1) P 



+ (10/3^ + 10/33 - I6/32 + 10/3 + 10) /33p2 + 10/32 (/3 + 1) (/32 + /3 + l) (/3 - 1)' P^ 



+ /3 (5/3^ + 14/32 + 5) (/3 _ 1)2 p4 + (/3 + 1) (/3 _ 1)6 p5 



(31) 



As expected, expanding (30) and (31) about P = gives exactly the same power series as the 
leading terms in (13) and (14). 

2) Cumulants to Second Order in n: In addition to computing the cumulants to leading order 
in n, it is also of interest to compute the correction terms (in n), to capture deviations and 
achieve higher accuracy at finite n. To this end, similar to before, we consider 

Gn{x) = n^Y{x) + Z{x) + O {n-^) (32) 

where Y(x) is defined as in (17) whilst Z(x) = \zi{x) + X^Z2{x) H — ■ , independent of n. Here, 
Z£{x) corresponds to the first-order correction term for the i-th cumulant (i.e., characterized by 
Ce^i, £ = 1, 2, ... in (16)) by the following integral: 

Ce,i=i\ / ^^dx, £ = 1,2,... (33) 

Substituting (32) into (8) (with A — )■ n\) and setting the second leading-order terms (in n) to 
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be zero, we obtain the simple algebraic equation involving Z{x): 

|2/3A - [A^ + 2(1 + /3 - a;)A + x"" + 2(1 + (3)x + (/3 - 1)^] Y' + {x - \ + (3 + 1)Y 

2 

- 4Y'Y + GxY'^^Z' = (2 {Y'f - {x + f3 + 1 - X)Y' - Y^ Z - ^ (Y"f . (34) 

This equation captures the exact first-order corrections to all leading order cumulant approxi- 
mations. Substituting Y{x) = Xyi{x) + \^y2{x) + ■ ■ ■ and Z{x) = \zi{x) + \^Z2{x) + ■ ■ ■ into 
(34) and matching the coefficients of A'', we are able compute the 2;/t(x)'s systematically. 

The case k = \ corresponds to the correction term for the mean. In this case, we obtain the 
equation involving Zi{x): 

{2/3 - [x" + 2(/3 + l)x +{(3- ifU + (x + /3 + 1)2/1} z[ 

2 

+ [{x + P + l) y[ - 2/1] z, + yyf = . (35) 

Again, once we substitute yi{x) with (21), the coefficient of the z[{x) vanishes and (35) collapses 
to an algebraic equation whose solution is 

(x2 + 2(/3 + l)x + (/3-l)') / 
The case k = 2 corresponds to the correction term for the variance. In this case, we obtain the 
equation involving Z2{x): 

{2/3 - [x^ + 2(/3 + l)x + (/3 - lf]y[ + (x + /3 + l)y^] 4 + [(x + /3 + 1)^; - y^] z^ 

+ {2/3-[x2 + 2(/3 + l)x+(/3-l)>^ + (x + /3 + \)y2^^x{y\f -2{\ + /3 - x)y\-^y\y^-y^) z[ 

+ [(x + /3 + l)y[ - y,] z,+ [{x + /3 + l)y', - 2{y[f - - y[] = . (37) 

The coefficient of Z2{x) is the same as in (35) and vanishes, which means (37) is an algebraic 
equation for Z2{x). More interestingly, after substituting yi{x) with (21) and y2ix) with (22), 
the coefficient of z[{x) also becomes zero, which further simplifies the computation. Thus we 
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obtain Z2{x) as follows: 

[{X + P + l)y[ - y,] z, + [{x + (3 + l)y', - 2{y[)^ - y^ - y[] 



+ 



{x + f3 + l)y[ - yi 
x^ {/3 + l)x^ + 2 {(3^ - 3/3 + 1) X + (/3 + 1) (/3 - 1)' 

2 ^^2^2{(3+l)x+{(3-l)Y^ 

x^ (/3+l)x3 + (3/32-14/3+3)x2 + (3/3-l) (/3-3) x+{/3^ +10/3+1) {/S-iy 



2\4 



(x2+2(/3+l)x+(/3-l)') 



(38) 



For the case A; > 2, we can derive a recursive equation as follows: 

k 

J2 < { + 2(/3 + l)x + {/3- lf]y'k-^+l + {x + f3 + l)yk-^+l} + 2/34 



i=l 



k~l 



k-j 



k-2 



{ y^-i + 2(1 + /3 - x)y'^_i + 4 ^ y'k-i-j+iyj - 6x ^ y'^-i-^+iV'^ \ + Yl ^'iVk-i-i 



1=1 

k 



k-1 



1=1 



[{x+P+^)y'k-i+i-yk-i+i] + Y- 



i=l 



i=l 



k—i 



X V ^ II 

~2 2-^ y^ yk- 



■i+l ■ 



i=l 



(39) 



The only term in (39) which involves Zk{x) (or its derivative) is: 



{-[x^ + 2(/3 + l)x + (/3 - 1)2] y[ + (x + f3 + 1)2/1 + 2/3} z', + [{x + P + l)y[ - y^]zk (40) 

where the coefficient of zl(x) is exactly the same as the one of y[{x) in (26), which has been 
shown to be zero. This observation indicates that we only need to solve an algebraic equation for 
Zk{x). Interestingly, apart from z',^{x), it is found that the coefficients of z-(x), i = 1, 2, . . . , /c — 1 
in (39) are also identically equal to zero. 

Given that this vanishing property holds for any k, eliminating terms involving z^{x),i = 
1,2, ... ,k in (39) immediately leads to the general recursive solution: 



Zk 



{x+/3 + l)y[-yi 



(41) 



With this result, we can compute Zk{x) in closed-form for any value of k. For example, 23(0;) 



Submitted to the IEEE Transactions on Information Theory (submitted June 2012). 



19 



is evaluated as follows: 



Z3{X 



X 



- + 8 (/3 + 1) + (49/3^ - 38/3 + 49) 



2(x2 + 2(/3 + l)x + (/3-lf)' 
6 (/3 + 1) (15/3^ - 38/3 + 15) + (65/3^ - 76/3^ - 42/3^ - 76/3 + 65) 



+ 4(/3 + l) (/32 + 26/3 + 1) (/3 - 1)2 x^ - (17/32 - 46/3 + 17) (/3 - 1)' x - 6 (/3 + 1) (/3 - 1)' 



X 



X 



9(/3 + l)x^-8(5/32-ll/3 + 5) 



2(x2 + 2(/3 + l)x+(/3-lf)"^' 
- 10 (/3 + 1) (5/32 - 13/3 + 5) x^ - (15/3^ + 134/3^ - 330/3^ + 134/3 + 15) 

+ (/3 + 1) (11/32 _ ^QQ^ ^ (/3 _ 1)23, + 6 (/32 + 5/3 + 1) (/3 - 1)' 



X 



X 



(42) 



Integrating ^^(x) via (33), we obtain the first-order correction terms in closed-form. For 
example, Ci^i and C2,i are given as follows: 

/3+1 /33 (/3+l)+3/32 (/3+l)2 P-i-3/3 (/3+1) (/32 + I) p^+((3^ + i) {p-lf 



C 



1,1 



24/3 



24/3 (/32 + 2/3 (/3 + 1) P + (/3 - 1)' P^) 



3/2 



/32+I 4 (/3+1) /3^P^ + (9/32-42/3+9) /32p4+6/3 (/3+1) (/3-l)^ p5 + (/3-l)^P' 



(43) 



2,1 



24/32 



24/32 (/32 + 2/3 (/3 + 1) P + (/3 - 1)' P2)'/' 



12(/32 + 2/3 (/3 + l)P+(/3-l)2p2) 

(/32 + 1) /3^ + 7/36 (/3 + 1) (/32 + 1) P 



+ 7/3^ (/32 + 1) (3/32 + 4/3 + 3) P2 + (/3 + 1) (35/3^ + 62/3^ + 35) /3^P=^ 

+ (35/3^+9/3^+8/33+9/32+35) /33p^+(/3+l) (/3-l)^ (21/3^-14/3=^-2/32-14/3+21) /32p5 



+ 7 (/3 + 1)' (/3 - 1)'^ /3P6 + (/3 + 1) (/3 - 1)» P 



8 d7 



(44) 



The first-order correction terms to the higher cumulants /ts, K4, . . . are omitted here in order 
to keep the presentation concise, though they follow trivially. Again, expanding (43) and (44) 
about P = yields exactly the same power series as the 0{n^^) term in (11) and the 0(n~2) 
term in (44). Note that if we aim to compute further high order correction terms (i.e., Cij, i = 
1,2, ... ,j > 2) for each cumulant, we can invoke the same procedure as what has been used 
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for computing the two leading-order terms, which only takes more algebraic effort. 

3) Large-n Series Computation for a Given Cumulant: In addition to the machinery used 
in the previous subsection for systematically computing the high order cumulants one after the 
other for a given order in n, we can also derive recursions for systematically computing the 
large-n series for a given cumulant. Here we give the computation of the mean as an example, 
with the corresponding computations for the higher cumulants following similarly. More details 
for the specific case rit = n,. can be found in [24]. 

Based on the power series representation of the CGF (in A), substituting 

Gn{x) = Xgi{x) + X^g2{x) + X^gsix) + ■■■ (45) 

into (8) and matching the coefficients of A^, we obtain the equations for gi{x),i = 1,2, .. .. For 
the mean we are interested in gi{x), which takes the form: 

{g'if -n' + 2 (/3 + 1) X + (/3 - 1)^) {g[f 

+ 2n^ [{x + (5 + 1) gi + 2(5n] g[ - n^gl = . (46) 

Making use of the large-n series structure in (16), we assume 

gi{x) = nQo{x) + -Qi{x) + ^Q2{x) + ... (47) 
n n-^ 

where Qi{x),l = 0, 1, . . . are independent of n and are related to the large-n series expansion 
for the mean via the following integral: 

Ci,,= r^'^^dx, 1=1,2,... (48) 

Substituting (47) into (46) and matching the coefficients of n^, equations involving Qi{x),i = 
0, 1, . . . are obtained successively. Considering the highest order in n, we have 

[x^ + 2 {(3 + 1) + {(3- lY] (Q'o)' -2[{x + (3 + l)Q, + 2/3] Q'^ + Ql = 0. (49) 

Note that this equation is exactly the same as (20), because they both characterize the leading 
order term (in n) of the mean and lead to the same solution (21). For A; > 1, we derive the 
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following recursion 

k k-1 

+ 2(x + /3 + 1) ^ Qk-^Q[ + x'J2 QU-iQ'l 

i=0 i=0 

k k 

= Q'^-^Q^ + E + + 1)^ + 1)'] Q'k-^Q'^■ (50) 

i=0 i=0 

Interestingly enough, we find again that in (50), the only term which involves Qk{x) (or its 
derivative) is 

{-2 [x^ + 2(/3 + l)x +{(3- If] g'o + 2(x + /3 + l)Qo + 4/3} Q', + 2 [(x + /3 + l)g'o - Qo] Qfc 

where the coefficient oi Q'^{x),k = 1,2, .. . vanishes identically. Setting k = 1 results in exactly 
the same equation as (37), since they both correspond to Ci,i in (16). For k > 2 we obtain the 
general recursion of Qk{x): 

Qk = 

Y!lZl[Qk~^Q^+{x^+'2{p+l)x+{P-lY) QUQ'^-'2{^+P+^)Qk^^Q[]-^^Y!:zlQU^lQ'l 

2[(a; + /3 + l)g'o-go] 

(51) 

Note that this recursion is a generalization of [24, Eq. (132)] which was derived for (3 = 1. 

Integrating Qk{x) via (48) gives us closed-form expressions for Ci^^, £ = 0,1, We write Ci^2 

here as an example, and omit the presentation of the higher order correction formulas for the 
sake of conciseness: 



C7i.2 = i ^ [ (1 + + 9 (/3' - /3 + 1) (/3 + 1)' /3*P 

240/33 240/33((/3-l)2p2 + 2(/3 + l)P + /32)!'/n^ ' ^ 

+ 18 (/3 + 1) (2/3^ + 3/3 + 2) (Z?" - /3 + l) /3^P' + 42 (2/3^ + /3 + 2) (/3^ - /3 + l) (/3 + 1)" /3'^P^ 

+ 42 (2^^ + ^ + 2) (/3^ - /3 + 1) + 1)^ [3^ P'^ + (126 + 126^** - 384/3'' + 126/3^ + 126/3^) ^"P'^ 

+ 6 (/3 + 1) (14/3** - 35/3^ + 35/3^ - 21/3-'^ - 16/3'* - 21/3^ + 35/3^ - 35/3 + 14) 0^ 

+ 6 (6/3* - 9^^ - 3/3*^ + 9^^ + 56^* + 9/3^ - 3^^ - 9/3 + 6) - 1)^ /32p'' 

+ 9(/3 + 1) (/3'* -3/3' +3/3^ + 3/3^ -3/3 + 1) (/3- l)-*/3P* + (/3^ - 3/3^^ + 3/3'' + 3/3^ - 3/3 + l) (/3 - l)''P''j. (52) 



Systematic derivations of the series expansions for the variance and higher order cumulants 
follow similarly, using the same approach. In this way, one can obtain the large-n series for 
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each cumulant to arbitrary degree of accuracy. 

IV. Large SNR Analysis of the Gaussian Deviation 

The systematic high order cumulant expansions derived in the previous section allow us to 
closely investigate the behavior of the mutual information distribution under various conditions. 
For example, our analysis in Section III-Bl recovered the fact that as the number of antennas 
n grows large, with the SNR P kept fixed, the distribution approaches a Gaussian. However, in 
practice, the number of antennas is finite and is not typically huge (though some recent trends 
have considered such systems [35-39]), whilst the SNR may vary significantly depending on 
the application. Thus, a natural question is, for asymptotically high SNR, if and by how much 
the mutual information distribution deviates from Gaussian. 

In this section, we focus on the high SNR regime of the mutual information. In this case, 
as shown in Figs. l(a)-l(b), the distribution appears to deviate from Gaussian as the SNR 
becomes large, whilst the deviation appears to be much stronger for the case of equal numbers 
of transmit and receive antennas (i.e., n = m) compared with the case n ^ m. This is an 
interesting observation which thus far has resisted theoretical validation or explanation. Here 
we draw insight into this phenomenon by employing our new closed-form high order cumulant 
expansions given in the previous section. 

Interestingly, by taking P large in (1), (2), (30), (43) and (44), it turns out that we obtain very 
different limiting results depending on whether /3 = 1 (i.e., = n^) or not. For /3 = 1, as P 
grows large we obtain^: 

^^-ilnm+^(^r + -- (54) 



2 

2 V 2 / ' 32 \ n 



"Note that the correction term for the third cumulant K-i (i.e., Cs,! in (16)) was obtained in the derivation described in Section 
III-B2, but was not presented explicitly in this paper for the sake of conciseness. 
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These large-n — large-P series expansions were also documented in [24], which focused primarily 
on the case Ut = rir- From these expressions, we see that for each cumulant the correction terms 
grow with P, and indeed grow faster than the leading order terms. As such, approximating each 
cumulant via their leading order term will be inaccurate when P is large. 
Now, considering /3 7^ 1, we obtain: 



11 ^ n 



InP- l)ln 



/3 



1 



+ 



120(/3 



o" ~ In 



/3 



n 



1 



+ 



120/33 
1 



1 

+ - 

n 

1 

TV' 



+ 



1 



Ll2(/3-l) 
1 



12/3 



252(/3 -l)5 252/35 



+ 



12(/3-l)' 12/32 J 



(/3 - 1) /3 



+ 



1 



+ 



(56) 
(57) 
(58) 



.6(/3-l)^ 6/33_ 

From these we can make some key observations, which we summarize in the following remarks: 
Remark 1: In contrast to the results in (53)-(55), for rit ^ Ur, all terms other than the leading 
term of fi are strictly bounded as P increases, converging to constants depending on /3. Moreover, 
as n or /3 increase, the correction terms have less effect, eventually becoming negligible, even 
when P is very large. 

Remark 2: For rit 7^ rir, the n-asymptotic power series representations for the cumulant s 
remain valid for arbitrary SNR P; whilst in contrast, for Ut = rir they break down for sufficiently 
large P. This clearly indicates that the commonly-assumed Gaussian approximation (based on 
the leading order terms of fi and a only) is quite robust at high SNRs for the case nt ^ rir, but 
not for the case Ut = rir. 

These remarks are illustrated in Figs. 3(a)-3(c), showing cr^ and K3. Fig. 3(a) shows that 
when Ut = n^, the leading order term of cr^ ([ q^ representing the variance of the Gaussian 
approximation) and the leading order term of K3 both deviate strongly from simulations when 
P is sufficiently large. In contrast, when Ut 7^ rir (i.e.. Figs. 3(b)-3(c)), the leading order 
cumulants capture the simulations accurately for arbitrary P, even with smaller n. We also see 
that increasing (3 enhances the accuracy of the leading order results, whilst including the first- 
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order correction term provides improved accuracy as well. These observations explain technically 
why the Gaussian approximation, based purely on the leading order terms of fi and a^, is 
relatively more robust to increasing SNRs in Fig. 1(b), compared with the results in Fig. 1(a). 
Nevertheless, even in the former case, the higher cumulants (e.g., of order 0{l/n)) still contribute 
to some deviations from Gaussian. These deviations become particularly significant in the tail 
region, yielding the Gaussian approximation unsuitable for capturing low outage probabilities 
of practical interest. This will be considered further in the subsequent sections, where we will 
make use of our closed-form cumulant expansions to "correct" for these deviations, both in the 
tail and around the mean, and thereby refine the Gaussian approximation. 
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Fig. 3. Comparison of to leading order (in n) and witli first-order correction. Kg to leading order and with first-order 
conection, and Monte Carlo simulations. Results are shown for different antenna configurations. 
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Before presenting these refinements, we would like to briefly connect our large- n-large- 
P cumulant expansions with existing results, which were obtained via different means by 
considering n fixed and taking P large at the beginning. This approach was used to obtain 
the leading-order terms of /i and in [22], and these results coincide exactly with the leading- 
order terms in (53) and (54). In addition, the first few terms of the mean and variance expansions 
(i.e., (56) and (57) respectively) match exactly with the results in [40, Eq. (134)] and [41, Lemma 
A. 2]. For example, [40] has the large-P mean representation: 

/P\ 

/i~n In — — n + mipim) — (m — n)ip{m — n), P — oo (59) 
\m J 

which gives (56) by expanding the digamma function tp{x) at large x as 

ib(x) = lnx-- 3— H 3— ^ + o(-^], X ^ oo. (60) 

' 2x 12x2 120x4 252x6 \x^ J ' ^ ^ 

Our results can also be shown to be consistent with the large-P MGF representation derived in 
[18]. 

V. Characterization with the Edgeworth Expansion 

Armed with closed-form expressions for Cij in (16), in this section we draw upon the 
Edgeworth expansion technique. This approach allows us to start with a Gaussian distribution and 
to systematically correct this by including higher cumulant effects (i.e., other than the mean and 
variance), giving an explicit expression for the corrected PDF. Moreover, the CDF, which directly 
defines the outage probability, can be obtained explicitly through a straightforward integration. 

We first write the MGF in the following form 

Mi\) = exp(f^^AM^^\X) (61) 



where A^('-^^(A) represents the MGF of a Gaussian distribution with mean /i and variance a^. 
Note that the MGF is, in effect, the Laplace transform of a PDF (evaluated along the real axis), 
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thus in the PDF domain (61) is equivalent to 

PX(x;y)(t) = exp J i-dtY^ Pfil,y)it) (62) 

where P^l..y^{t) = 7^*2"^^/^, z := {t — ^)/a and dt := d/dt. Expanding the exponent in (62) 
and collecting terms according to the powers of a, the PDF of the mutual information takes the 
form [42, Eq.43]: 

where 

is the quantity which determines any deviation from Gaussian. Here, L is a positive integer char- 
acterizing how many cumulants are included in the corrected PDF (i.e., k^, £ = 3, 4, . . . , L + 2 are 
involved). Note that the second summation enumerates all sets A; = {/ci, /c2, ■ ■ ■ , ^s} containing 
the non-negative integer solutions of the Diophantine equation ki + 2k2 + ■ ■ ■ + skg = s. For 
each of these solutions, a corresponding constant r is defined as r = ki + k2 + ■ — \- kg. In [42], 
a practical algorithm for computing the {k} solutions is proposed in general. Hei{z) is the i-th 
Chebyshev-Hermite polynomial, with the explicit form [42, Eq. (13)] 

HeAz) = y , ^ , (65) 

where [-J denotes the floor function. These are generated by differentiating the standard normal 
distribution: 

HeH^) = (-l)'^?J'H2)MW (66) 

where g^i^z) = exp (—z"^ /2) / V2Tr and the superscript {£) denotes the i-th derivative w.r.t. z. 
Here we list some specific polynomials that we are going to use subsequently: 

Resiz) = z^ -3z, He4(2) = - + 3, He6(^) = - 15^^ + 45^^ - 15 . (67) 
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In theory, we can approximate the mutual information distribution with arbitrary accuracy, by 
taking L sufficiently large in ^{z). We first set L = 2 (i.e., including ^3 and K4): 



PX(x;y)(^) 



P 



(G) 



a 



1^3 



He, (z) 



■He4 (2) + 



rHe« (z) 



(68) 



and compare the Edgeworth expansion with the Monte Carlo simulations and the Gaussian 
approximation in Fig. 4. Note that for the Edgeworth expansion curves we only use the leading 
order terms of the cumulants (i.e., Cefl/n'^~^ in (16)), since as we have shown, the leading order 
terms of the first few cumulants give valid approximations for arbitrary P if /3 7^ 1 (c.f. Remark 
2). 
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Fig. 4. PDF of mutual information, comparing tlie Gaussian approximation, Edgewortli expansions (witli L = 1 and L = 2), 
and Monte Carlo simulations. Results are shown for 714 = 3, Tir = 2 and for different SNRs. 



To examine the outage probability, we need to derive the CDF of the mutual information. 
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Recalling (66), we have 

/ Eei (u) go {u) du ^ Eei{u) go{u)du = -Eei_i{z)go{z). (69) 



Therefore, the Edgeworth PDF formula (63) can be immediately integrated to give the CDF: 

\ ile<i+2r-l [Z) J 

s=l {k} f=l ^ ' ^ - 

where Q{z) := e~"^/^dM. Comparisons between the CDF curves computed by the 

Edgeworth expansion and the Gaussian approximation are made in Fig. 5. We see that in the 
tail region, the Edgeworth expansions with higher cumulants nicely approach the simulations (in 
this case, L = 6 was required to achieve good accuracy), whilst the Gaussian curve strongly 
deviates. This confirms that, by the virtue of our new cumulant expressions and the Edgeworth 
expansion, we can compute the outage probability of MIMO systems with high accuracy in 
closed-form for arbitrary SNR. 

Moreover, from these numerical tests, for small SNR (i.e., SNR = — 10dB,0dB), we see 
that the Gaussian approximation overestimates the outage probability in the tail, whilst for large 
SNR, the outage probability is underestimated. These tail deviations can be analyzed as follows. 
For points far away from the mean (i.e., large z), we approximate 

He, (z) ~ (71) 

and the Edgeworth expansion (64) becomes 



s=l {k} 



For large z, the highest order of (z/aY^'^'^ will dominate the lower order terms. Since r is 
maximized for r = s with {/ci, k2, - ■ ■ , ks} = {s, 0, ■ ■ ■ , 0}, letting L — )■ oo, V{z) can be further 
estimated as 



s=l 
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Fig. 5. CDF of mutual information, comparing the Gaussian approximation, Edgeworth expansions (with L = 4 and L = 6), 
and Monte Carlo simulations. Results are shown for rit — 6,nr = 3 and for different SNRs. 



(- 



1^3 



With this, the PDF of the mutual information becomes: 



(73) 



PX(x;y)(^) 



e 2ct2 ' 6<T 



2Txa 



(74) 



which gives a more accurate distribution than the Gaussian in the tails for finite n, since the 
effect of ^3 is considered. The same formula was presented in [24], which focused on the case 
Ut = rir. Here, we provide more general insights with our new cumulant expressions which 
apply for both rit = rir and nt ^ rir. 
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First, (74) indicates that the Gaussian approximation is accurate under the condition: 

. (75) 



\t - n\ « 



3 



6^6 



6al 



3,0 



Thus, the large-n Gaussian approximation is valid for deviations of |t — /i| ~ 0{n^^^) or less 
from the mean. Outside this regime, the second term of the exponent in (74) becomes non- 
negligible. In this case, by examining the factor n^/a^ in (74), we can further understand how 
the distribution behaves compared with the Gaussian approximation. To this end, considering 
the variance and third cumulant to leading order in n (i.e., (Tq given in (2), and C^^o/n with Cs^ 
given in (30)), for small P, 



f^3 2(3 



P -> , (76) 



whilst for large P, 



_2_ 

In 

cr" I 1 



^ ^ j n {In Pf ' ^ - ^ 

(3 > 1 



P ^ oo . (77) 



n/3(/3-l)(ln(l-/3-i))=* ' 



These results are in perfect agreement with [27, Eq. (70)] obtained via Coulomb fluid arguments, 
and also [24, Eq. (218)] obtained for the special case (3 = 1. Therefore, seen from (74), for small 
SNR, Ks/a^ > 0, and the left tail of the PDF should be always above the Gaussian approximation 
(similarly for the CDF in the left tail). For large SNR and (3 > 1 on the other hand, n^/a^ < 0, 
and the situation is the opposite. 

Note that the interpretation of the large SNR results above for the particular case (3 = 1 
should be taken with caution since, as discussed in Section IV and also in [24], the leading 
order expressions for cr^ and (upon which the arguments are based) become inaccurate in 
that scenario, unless n is also very large. For the case (3^1 however, there is no such problem. 

VI. Refining the Tail Distribution via the Saddle Point Method 

Whilst the Edgeworth expansion technique provides an accurate closed-form characterization 
of the mutual information distribution, it becomes unwieldy when too many cumulants are needed 
for obtaining the desired accuracy. Particularly, in the case when one is interested in the tail 
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region of 0{n) away from the mean (i.e., the "large deviation" region discussed in [27]), we 
need to consider the effect of all cumulants to obtain high accuracy. Therefore, to supplement 
the Edgeworth expansion results, in this section we will draw upon the saddle point method and 
the cumulant results from Section III to further investigate the large deviation scenario. 

A. Saddle Point Method 

Assuming that the MGF (equivalently, the CGF) is known, the PDF of the mutual information 
can be derived through the inversion formula: 

Px(x;y)(t) = ^ y e^^'^^'^dX, (78) 

where /C(A) is the CGF and r is a real number defining the integration path. In most cases, it 
is infeasible to evaluate (78) in closed-form. However, noting that /C(A) — At ~ 0{n) (given 
/i ~ 0{n), thus /C(A) = yuA + ^A^ + . . . 0{n) and t = fi + 0{n)a ~ 0{n)), if n is large, the 
saddle point method in [43] can be used to provide an accurate approximation for this integral. 
This is done by choosing the path of integration to pass through a saddle point A* such that the 
integrand is negligible outside the neighborhood of this point. More specifically, for a given t, 
A* is computed as the real-valued root of 

t = /C'(A) , (79) 
where ' denotes derivative w.r.t. A. Defining the so-called rate function, 

I{t) := tX*{t) - IC{X*{t)) , (80) 
the PDF to leading order in n admits [43] 

g-/(t) 

PX(^-y){t) ^ = . (81) 

By the virtue of the saddle point equation (79), we can examine how the large-n behavior of 
the mutual information varies in different regimes of |t — /i| ~ O{n^),0 < e < 1. Substituting 



Submitted to the IEEE Transactions on Information Tlieory (submitted June 2012). 



32 



the cumulant power series expansion of the CGF (6) into both /(t) and (79) gives 



m = {t- ^)A^(t) - -{x*{t)r - -^{y{t)f + o((A*(t))^), 



(82) 



with A*(t) the solution to the saddle point equation 



t = fl + (T'^X*{t) + Kg 



+ K4 



(83) 



2 



3! 



Based on (82) and (83), we draw the following remarks: 

• In the region |t — /i| ~ 0(1), as n — )• oo, recalling that /i = 0{n), cr^ = 0(1), whilst 

= 0{n^^^) for £ > 3, (83) results in A*(t) ^ (t — ij)/a'^ which yields the Gaussian 
exponent I{t) ~ (t — /i)^/(2cr^) in (82). We see that the higher cumulants (e.g., K3, K4, etc.) 
vanish for large n in (83) and (82). This is the region where, as argued in [27], the central 
limit theorem is valid, and thus the Gaussian approximation is asymptotically accurate. 

• Looking at further (sub-linear) deviations from the mean, in the region |t— /i| ~ 0{rf), e < 1, 
(83) generates the same saddle point A*(t) = (t — as n — t- 00. However, since 
\*{t) ~ 0{rf), in this case some of the terms in (82) involving the higher cumulants 
{ki, £ > 3) do not vanish. More specifically, this includes all cumulants Ki for which 
(£ — 2)/£ < e. For example, for large n, becomes effective (provides a non-negligible 
contribution) for deviations of 0{n^^^) or more, K4 is effective for deviations of 0(n^/^) 
or more, K5 is effective for deviations of 0{n^^^) or more, and so on. This behavior is 
consistent with the discussion in the previous section and in [24]. In these scenarios, our 
Edgeworth expansion results presented in the previous section provide an accurate closed- 
form approximation, by accounting for a fixed number of higher cumulant effects. However, 
when the deviations become stronger (e.g., as e increases towards 1), more and more terms 
are required in the Edgeworth expansion in order to account for the increasing number 
of non-negligible high order cumulants, thereby significantly increasing the computational 
complexity. 

• Looking at even further (linear) deviations from the mean, in the region \t — n\ ~ 0(n), 
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as n — 7- oo, (83) indicates X*(t) ~ 0{n). In this case, all cumulants to leading order in n 
(i.e., an infinite number) contribute in (83) and (82). In addition, we find that /(t) ~ 0(?i^). 
In this "large deviation" region which is sufficiently deep in the tails of the distribution, 
the Gaussian approximation strongly misses the correct behavior, whilst the Edgeworth 
expansion in (70) also becomes intractable. 
Importantly, the above discussions provide a unified picture of the mutual information distribution 
for large-n. Specifically, they show that the well known Gaussian approximation, our Edgeworth 
expansion approximation, and the large deviation results (also considered in [27]) have their own 
region of validity, depending on how far one looks into the tail of the distribution as n increases. 
Whilst the Gaussian approximation and the Edgeworth expansion have been well characterized in 
the previous sections, further work is required in relation to the large deviation region, which is 
now considered. First, we note that a key problem encountered with (81) is how to compute I{t). 
In addition, whilst in principle the CDF of the mutual information can be obtained by integrating 
the PDF expression (81) (i.e., -Fx(x;y)(t) = px{:x.;y){u)du), this is difficult and generally does 
not yield a closed-form expression. We will first address this integration problem, then deal with 
the problem of computing the rate function J(t). 

In order to integrate (81), we can derive an asymptotic expression for large n via Laplace's 
method (see e.g., [44, Chapter 2]). Recall that, as previously described, I{u) ~ O(n^), whilst 
IC"{X*) ~ 0(1). Since —I{u) is an increasing function for n < (decreasing function for u > /i), 
to leading order in n, the integration of (81) is dominated by the region in the neighborhood of 
the upper limit t for u < n (lower limit t for u > yu). Thus we expand I{u) about t as 

I{u) = m + {u- t)I\t) + ^-^I"{t) + 0((n - tf). (84) 

Additionally, in the neighborhood of t, the function }C"{\*{u)) is nearly constant and can be 
approximated by its value at t. Based on these arguments (see [44, Chapter 2] for more details). 



Submitted to the IEEE Transactions on Information Theory (submitted June 2012). 



34 



we obtain 



-du 



_ y/27TK"(X*(t)) 

(85) 



By neglecting the terms in the exponent of order (u — t)^ or higher, we have 

"^^^^^fc^' . (86) 

1 + — ; t> 

If one were to instead neglect the terms of order (m — t)^ or higher in (85), then the (less) 
asymptotic formula proposed in [27, Eq. (63)-(64)] results. However, it is important to point 
out that (86) and [27, Eq. (63)-(64)] each involve the derivatives of I{t) (e.g., /'(t) and /"(t)), 
which are difficult to handle, both analytically and computationally. Thus, it is of interest to 
derive a more manageable representation, which is now pursued. 

Recall that the Gaussian approximation is accurate around the bulk of the distribution (which 
captures the vast majority of the area under the PDF curve), whilst the absolute difference 
between the true PDF and the Gaussian approximation is typically small outside of the bulk 
(since the PDF naturally takes extremely small values in this region). Thus, the Gaussian exponent 
(i.e., (t — /i)^/(2cr^)) should capture the leading effect in I{t). As such, we expand the integrand 
around the mean (equivalently, around X*{t) = 0), giving: 



Fx(..,)(t)^ / ^ ^ e-^-'^»"-^)^)dn = ^ / e-^(")e-^dn (87) 
^ '""^^ ' Jo V27r/C"(0) v^a ' 



where 

:= liu) - . (88) 

Noting that |/(n)| <^ {u — /i)^/(2(T^), we adopt Laplace's expansion method again as follows. 
Since (u — /a"^ is a decreasing function for u < fi (increasing function for u > /i), to leading 
order in n, the integral (87) is dominated by the region in the neighborhood of the upper limit t 
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for u < fi (lower limit t for n > /i). In this neighborhood, the function I{u) can be regarded as 
effectively a constant and can be approximated by its value at t, which leads to the following: 



L ?! 



t < /i 

• (89) 

t > n 



V^o- Jt 

Thus the CDF of the mutual information is represented by the concise formula 

^X(x;y)(t)^ " ^ ^ (90) 

In the next subsection, we will show that this asymptotic CDF captures the distributional behavior 
very accurately. 

Now we address the remaining challenge: computing a manageable expression for I(t), and 
therefore I{t) via (88). Recalling the large-n expansion structure in (16), summing up the leading 
terms of the cumulants (i.e., Cifl/n^~^ ,i = 1,2, . . .) gives us the n-asymptotic CGF. However, 
in general, the complexity of the expressions for the Ci^s makes it difficult to derive a generic 
closed-form formula for this asymptotic CGF. Thus, to make solid analytical progress, we focus 
on the scenarios of small- P and large- P. 

B. Asymptotic CGF at Low SNR 

We first consider the n-asymptotic cumulants for small-P. In this case, (11)-(15) indicate the 
generic cumulant expression 

m ^ (i - ly. mn ( , £=1,2,... (91) 
\m J 

Here ki = n and K2 = a"^. Thus the CGF is obtained as 

^ (A) ~ f: ^ (-X) ' = -mn In (l - ^x) . (92) 

In fact, we can also recover (92) by scaling and solving the exact equation for the n-asymptotic 
CGF in (19), where the derivatives are taken w.r.t. x. To this end, knowing that Y{x) = \yi{x) + 
\^y2{x) + ■ ■ ■ , and yt{x) ~ 0{x'^) (indicated by the small P expansion results in Section III-A), 
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in order to keep the terms to leading order in x of Y[x), we introduce the following variable 
substitution: 

-:=!/. (93) 

X 

With this, equation (19) becomes 

-y(^ + t^ + y^Y' -Y =Ay{yY' + Y-P) (^-l^Y' + y^Y' (94) 

where ' is the derivative w.r.t. y. Note that here we consider the large x (equivalently, small P) 
but finite y scenario. Letting x — oo, we have 

[Y' {y' + y)+YY = %2 {yY' + Y-/3)Y' (95) 

with the solution 

Y{y) = . (96) 

Integrating Y(y), we obtain the large-n-small-P CGF: 

/C(A) ~ ^^dx 

J CO 

/ P \ m 

= -mn\n(l A , A<— (97) 

\ m J P 

in agreement with (92), which was obtained by summing the cumulants. Note that the CGF in 
(97) corresponds to that of a chi-square distribution, indicating that in this low-P scenario: 

X(x;y)^-^x'(2mn). (98) 
2m 

This approximation is in fact quite well known, and it can be readily established by noting that 
lndet(I„ + ^HHt) ^ £tr(HHt) for small P (i.e., obtained by expanding lndet(I„ + ^HH^) 
around P = 0, and keeping the first term), and the obvious fact that tr(HHl^) ~ x^(2mri)/2. 

As depicted in Fig. 6, if P is very small (e.g., P = — 5 dB), then the chi-square approximation 
lines up quite well with the simulations; however beyond this very small regime (e.g., for P = 
5 dB), it is inaccurate. Thus, for greater validity, further refinement beyond the leading-order 
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chi-square approximation is necessary. This requires computing the higher order correction terms 
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Fig. 6. CDF of mutual information for small SNRs, comparing the Gaussian approximation, chi-square approximation, and 
Monte Carlo simulations. Results are shown for nt = 4, Ur — 2. 



(in P), a task which appears difficult via direct expansion of the lndet(I„ + ^HHt) formula, 
as indicated above for the leading order chi-square approximation. To our knowledge, such 
refinement has not been computed thus far. 

To develop a systematic method for solving this refinement problem, we may once again make 
use of our Painleve representation. Noting that x is large whilst the new variable y is finite, we 
assume the following large-x expansion: 

Yiiy) , y2{y) , 



Y{y) = Yo{y) + 



(99) 



X x^ 

Substituting (99) into (19) and matching the coefficient of x^'', we can compute Yk{y) system- 
atically in closed-form. For example, 

(3{(3 + l)y 



Yi{y) 



Integrating Yi{y) through (97), we obtain the CGF with first-order correction term (in P): 



(100) 



^ ^ ^ m J 2 {XP/m-lf 



(101) 
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With this, the saddle point (79) is obtained as 

mnP [-2A*^p2 + (nP^ + mP^ + 4mP)A* - 2m^ + mnP + m^P] 



(102) 



2(A'^P-m)^ 

This expression can be solved in closed-form, with the resulting expression involving a cubic 
equation. Alternatively, one may trivially compute the solution numerically for any given value 
of t. With A* solved for a given t, the value of the rate function I{t) and thus I{t) follow 
immediately according to the definitions (80) and (88) respectively (with /i and in (88) 
approximated via /iq and erg). By invoking the CDF formula (90), we can then compute the saddle 
point approximation for the mutual information distribution. This approximation is illustrated 
in Fig. 7. Compared with the chi-square approximation, it is shown that this refined CDF is 
remarkably accurate, even for moderate P values (e.g., P = 10 dB). As P further increases 
(beyond, for example, P = 15dB), we have found that the saddle point approximation starts to 
miss the correct behavior, and higher order correction terms (equivalently, Y2{y), Y^(y), . . .) are 
needed. These can be systematically computed using the same procedure as before. 
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Fig. 7. CDF of mutual information for small SNRs, comparing the Gaussian approximation, saddle point approximation, and 
Monte Carlo simulations. Results are shown for nt = 2, = 2 and for different SNRs. 



C. Asymptotic CGF at Large SNR 

Now we consider the large- n-large-P scenario. Based on the expressions which have been 
computed for C£fi,i = 1,2, .. . for P — )■ oo (e.g., (56)-(58)), we obtain the generic expression 
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for the leading term of the i-th cumulant (£ > 3): 

1 



I \l-2 



1,2,. 



(103) 



With this, upon summing the CGF series, the asymptotic CGF is computed in closed form: 



/C(A) ^ ^oA + ^A^ + 5: 



^=3 



(m — n) ^ 



Now, from (1) and (2), we have 



/io ~ n In P — (m — In 

m 



m — n 



X 



(104) 
(105) 



(Jq ^ In 



m — n 



m 

P -> 00, 



n 



P CX), 



(106) 
(107) 



giving the following large- n-large-P CGF: 



/C(A) 





3 


nln ( — I 




\m J 





A + 



{m + Xf 



In (m + A) 



(m — + A)' 



?7Z^ {Tfi — Tl]^ 

■ In (m — n + A) In m H In (m — n) 

2 2 



(108) 



valid for X E {n — m, +00). 

Remark 3: Interestingly, although (108) was obtained based on the cumulant expressions which 
are valid only for (3^1 (i.e., since a singularity exists for /3 = 1 in the n-asymptotic cumulants, 
as seen in (56)-(58)), by setting /3 = 1, we have 

P^ 



/C (A) ~ nX In 

l2 



3 _ + A) _ , 

-riA + ^ — In (n + A) 

2 2 ^ ^ 



In A In n . 

2 2 

This will be shown to describe the correct behavior of the mutual information. 
The saddle point (79) can be computed by 



(109) 



t = nlnP - nlnm - n + (m + A*) ln(m + A*) - (m - n + A*) ln(m - n + A*). (110) 
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Whilst a closed-form solution for (110) is intractable, it can be trivially computed numerically 
for any given value of t. 

By invoking (81) and (90), we can compute the distribution (both the PDF and CDF) of the 
mutual information. We should point out that t in (110) is a monotonically increasing function 
of A (i.e., dt{\)/dX > 0), thus for any t > n\n{P/l3) — n, there exists a real root A*. This, 
in tum, implies that the distribution cannot be captured explicitly by (110) if t is sufficiently 
small such that /i — t > mln {(3) — [m — n) ln(/3 — 1) (i.e., when looking sufficiently far into 
the left tail region). Nevertheless, the right-hand side of this inequality is increasing with m 
(fixed n)\ thus as m grows, the valid region of (110) extends further into the left tail, allowing 
smaller outage probabilities to be calculated. In fact, scenarios for which m is reasonably large 
compared with n is quite realistic in many applications; for example, in cellular systems, for 
which the base station may be equipped with a reasonably large number of antennas, whilst the 
number of antennas on the mobile device is more restricted due to limited space constraints. 

Fig. 8 depicts the saddle point approximation of the PDF (81) with I{t) computed from (108) 
and (1 10), comparing with the Gaussian approximation and Monte Carlo simulations. The saddle 
point result is clearly much more accurate than the Gaussian, and lines up almost perfectly with 
the simulations when the SNR is sufficiently large (i.e., at 30 dB). Similar observations are made 
in Fig. 9, which shows the corresponding CDF curves based on (90) and the same I{t). Note 
that for these results we have chosen m = 6, n = 2, where m is comparatively large enough 
such that the validity of (110) extends deep enough into the left tail region to capture outage 
probabilities of interest. 

Fig. 10 depicts the complementary CDF (CCDF) of the mutual information, comparing the 
saddle point approximation based on I{t) computed from (108) and (110), the Gaussian ap- 
proximation, and Monte Carlo simulations. As for the left tail, we see that the saddle point 
approximation becomes extremely accurate when the SNR is sufficiently high, and significantly 
outperforms the Gaussian approximation. In fact, quite surprisingly, even for moderate SNRs 
of 20 dB, the saddle point approximation in the right-hand tail traces the simulated curve very 
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Fig. 8. PDF of mutual information, comparing the Gaussian approximation, saddle point approximation, and Monte Carlo 
simulations. Results are shown for nt = 6, Ur = 2 and for different SNRs. 




Mutual Information (nats/s/Hz) Mutual Information (nats/s/Hz) 

Fig. 9. CDF of mutual information, comparing the Gaussian approximation, saddle point approximation, and Monte Carlo 
simulations. Results are shown for nt = 6, ?ir ~ 2 and for different SNRs. 



closely. 

As done for the small-SNR scenario, for large-SNR we can also evaluate the higher order 
correction terms (in P) in order to draw insight into the accuracy of the leading-order results. 
To this end, it is convenient to first introduce the change of variables x — )■ in the Painleve 
representation of the CGF (7) and the large-n equation (19). The CGF to leading order in n can 
then be written as 

]C{n\)r^-n^ r^^^^^dx, n^oo, (111) 
Jo X 
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Fig. 10. CCDF of mutual information, comparing the Gaussian approximation, saddle point approximation, and Monte Carlo 
simulations. Results are shown for different antenna configurations and different SNRs. 



with y(/3/x) satisfying 



f3 



x^Y' + Y 



4 i-xY' -Y + /3){ jY' - X 



X 



/3 



-Y' 



(112) 



where Y' := dY(/3/x)/dx. By noting that in (108) there exists first-order terms in P which are 
0{\nP) and second-order terms which are 0(1), we assume that the CGF admits the following 
generic large-P expansion: 



]C{nX)=n' ( AlnP + 6o + ^ + ^ 



(113) 
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where the coefficient bo denotes the constant term in (108) and 6j, z = 1, 2, . . . depend on /3 and 
A. Taking the derivative of /C(nA) w.r.t. P, we obtain the following power series expansion: 

d flC{nX)\ 

^-A + ^ + ^ + - (114) 

Substituting (114) into (112) and matching coefficients of the powers series of P on the left and 
right-hand sides, we solve the h's: 

A/3 



bi 
b2 
bs 



/3 + A-1 ' 
/3^A(/3-l)(/3 + A) 

2(/3 + A-l)' ' 
/33A(/3 - l)(/3 + A)(/3 + A + l)(/3 - 1 - A) 

3(/3 + A-l)' 

(115) 



Together with (113), we obtain the large- n-large-P CGF with higher correction terms in P: 

/CA ~/CoA + —±- ^v^; ... ^^^^ P^oo (116) 

m — n + X P 2n[m — n + A)* P^ 

where /Co(A) denotes the leading-order large- n-large-P CGF expression in (108). Based on this 
formula, we draw the following remarks: 

• First, we see that the higher correction terms vanish rapidly as m — n + X increases 
(equivalently, A grows for fixed m and n). Meanwhile, as A decreases and approaches n—m, 
the correction terms become large and eventually invalidate the expansion. This indicates 
that the leading-order large- n-large-P approximation for the right-hand tail (corresponding 
to positive A) is more robust for finite values of P, compared with the approximation for 
the left-hand tail (corresponding to negative A). This is consistent with the results shown in 
Figs. 9 and 10. 

• Interestingly, as we keep computing fe/s, it is found that bi,i > 1 have the common factor 
of m — n. Assuming this is true for all higher correction terms, then for equal antenna 
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arrays (i.e., m = n), we have 

/C(A) ~ /Co(A) + — , n^oo. (117) 

Note that the single correction term n^/P is independent of A, meaning that the saddle 
point equation (110) (after setting m = n) is unaffected by including correction terms for 
finite P. This agrees with the formula in [27, Eq. (53)] and the corresponding argument 
made therein. 

Based on (116), we plot the saddle point approximation with first-order correction term (in P) 
in Fig. 11. The refinement brought by including the correction term is clearly evident. Moreover, 
we find that the right-hand tail is captured accurately for moderate P = 15 dB even without the 
first-order correction, which is in line with the discussion in the first point above. 

D. Important Large SNR Behavior in Left and Right Tails (Including DMT) 

In this section, we develop further our analytical results under the assumption that t = 
qnlnP, P — t- oo, where g > is a fixed constant. This represents the setting where the data rate 
is specified to grow as a non-vanishing fraction (i.e., q) of the mean mutual information for large 
P (i.e., nlnP). This assumption is important in various contexts; for example, in specifying the 
fundamental DMT [29], as well as capturing the scheduling gains in opportunistic multi-user 
downlink transmissions [22]. 

We begin with the scenario q > 1, in which case we are interested in the mutual information 
distribution at the right-hand side of the mean. This scenario is relevant for evaluating the 
performance of scheduling algorithms for which the multi-antenna base-station transmits to the 
multi-antenna user with the best channel (i.e., Xbest = max{Xi,X2,. . .}, with denoting the 
mutual information between the base-station and the A;th user, all of which are assumed to be 
independent and to undertake the same distribution). In this case, the mutual information achieved 
by the system will typically be above the mean mutual information for each user /i, and the 
performance gains of such "best user" selection algorithms can be characterized by studying the 
distribution of the per-user mutual information in the regime t > /i. See [22] and [27]. 
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Fig. 11. CDF and CCDF of mutual information, comparing the Gaussian approximation, the saddle point approximation with 
and without the first order corrections (in P), and Monte Carlo simulations. Results are shown for different antenna configurations 
and different SNRs. 



Substituting t = qn\nP,q> 1 into (110), we have the following asymptotic solution: 

X*r^nPi~\ P^oo (118) 



which further yields 



I{t) -ri^P^-MnP'? 



''*e*/% P^oo. (119) 
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Therefore, for high P, the CCDF admits 

Pr(X(x; y) > t) ~ e-i^^*^", t > /i . (120) 

This agrees with a result derived recently in [27, Eq. (79)], by asymptotically solving a set of 
coupled equations obtained via a Coulomb fluid formulation. From (120), we find the probability 
of the mutual information taking greater values than the mean drops very sharply (doubly 
exponentially with t), indicating that for large SNR, the best-user scheduling algorithm indicated 
above will not enhance the overall data rate significantly. 

Now we consider the alternative regime, q < 1, corresponding to the DMT framework 
seminally proposed by [29]. In this case, however, we find that one cannot simply adopt the 
direct approach of substituting t = qnlnP with g < 1 into (110), since a solution for the 
asymptotic A* does not exist. This can be explained by noting that the solution in fact lies in the 
range \* < n ~ m which can not be described by (108), because t = qnhiP < n\n.{P/ (3) — n 
(i.e., the smallest value of t that can be covered by (110)) for q < \ and large P. Nevertheless, 
in the following, we are able to draw upon the exact characterization of the large- n-large-P 
CGF (19) to derive the DMT formula. 

Since in the large deviation regime. A* ~ 0{n) such that all the cumulants to leading order 
in n remain effective for large n, we scale the CGF variable A — j- nA before taking n ^ oo 
in (121). Further, by recalling the definition of the CGF, its leading order representation in P 
should be O(lnP). Consequently, the asymptotic characterization of the CGF admits 

fP Y(B/x) 

]C(nX)^-n^ / U x^ An'^lnP, n ^ oo, P ^ oo (121) 

Jo X 

where Y{(3/x) satisfies (112) and A denotes a certain function of (3 and A. In light of the DMT 
formulation [29], we require the coefficient of the O(n^lnP) term of the CGF, i.e., the quantity 

P^oo V In P Jo X ) 
Note here that we cannot employ the assumption F(/3/x) = Yl^=o^k/x'', since integrating 
this power series diverges for each term. This motivates us to introduce suitable variable trans- 
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formations to (111) and (112) as described below, which are aimed at scaling x to increase with 
P, whilst keeping the new variable of integration finite. To this end, observe that 



InP 



p 



Y{l3/x) 



1 



X 



InP 

-1 



p 



Y{^/x) 



dx 



1 



X 

Y{P/x)<l 



In e/ In P 
1 



InP 
Inx 



Y{(5/x) 



X 



Y{P/x) 



InPj In P Jo 

Y{/3/P') ds , P ^ oo 



dx 



X 



(123) 
(124) 

(125) 



where the new variable s := \nx/\nP and e is an arbitrarily small constant. Here e is introduced 
to avoid the singularity at the lower limit when changing dx/x on the right-hand side of (123) 
to d(lnx) in (124). Nevertheless, for fixed e, the second integral in (124) vanishes as P — )■ oo 
and the lower limit of the first integral becomes zero, thus we arrive to the asymptotic formula 
(125). With (125) we have 



with Y{(3/P') satisfying 



Y' 
hTP 



(3 + \ + l\ P 



InP 



A = lim 

P— >-oo 



Y' + Y 



Y{f3/P') ds 



(126) 



Y' 



InP 



ps 



(3\nP 



Y'-\ 



ps 



(3\nP 



Y' 



(127) 



where Y' := dY{(3/P^)/ds. By comparing the coefficient of P'^/lnP (i.e., the terms corre- 
sponding to the leading order in P) in (127), we have the equation involving Y{/3/P'^): 

ps 



JlnP 

which gives a constant solution 



iw^^') [41^ - 4/3 + (/3 + A + If] = 0, 



(128) 



Y{f3/P' 



{f3 + X + lf 



(129) 



Thus we obtain the large- n-large-P CGF via (125) and (121): 

■(/3 + A + l)2 



/C(nA) ~ 



InP 



n — )■ oo, P — )■ oo . 



(130) 
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By invoking the saddle point equation (79): qnlnP = -^^^IC{n\), we have A* = 2q — (3—l. This 
result reconfirms our statement that the DMT cannot be described by (110), which implicitly 
requires nX* > n — m. With this saddle point, the rate function I{t) (with t = qnln P, q < 1) 
is evaluated as 

I{qn\nP) = qn^ X* \n P - JC{nX*) = n'^ [q^ - q{P + I) + P . (131) 

Consequently, the CDF of the mutual information in the left tail for large P (and large n) 
becomes 

In Pr(X(x; y) < In P) ~ — [qn — m) [qn — n)\nP, x < fi . (132) 
This agrees precisely with the well known DMT result in [29]. 

VIL Conclusion 

Capitalizing upon the exact Painleve based representation for the MGF of the MIMO mutual in- 
formation in [24], we have systematically computed new expansions for the high order cumulants 
of the mutual information distribution which apply for arbitrary SNRs and for asymmetric antenna 
arrays. In particular, closed-form expressions were given for the leading order terms (in n), as 
well as the first-order correction terms which capture finite -antenna deviations. Based on these 
new expressions, we established key novel insights into the behavior of the distribution under 
different conditions; for example, explaining why the n-asymptotic Gaussian approximation is 
more robust to increasing SNRs for asymmetric systems compared with symmetric systems. 
This is an interesting phenomenon which appears difficult to capture with other methods. In 
addition, we called upon the Edgeworth expansion technique along with the high order cumulant 
formulas to provide closed-form refinements to the Gaussian approximation for the tail region 
corresponding to 0{n^) (0 < e < 1) deviations from the mean. For deviations of 0{n), the so- 
called "large deviations" region, the Edgeworth expansion requires summing over all cumulants 
and becomes unwieldy; thus, in this region we employed a saddle point approximation technique 
along with asymptotic integration tools to derive very simple and concise formulas for the 
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CDF for the cases of low and high SNRs. Simulations showed that our results captured the 
tail distribution very accurately for outage probabilities of practical interest. Moreover, whilst 
formally derived based on a large-antenna framework, they were shown to be very accurate even 
when the antenna numbers are small. To emphasize the utility of our framework even further, 
in the end we recovered well known properties of the tail distribution of the MIMO mutual 
information, including the DMT. 

We conclude by noting that the key analytical tool underpinning the analysis in this paper, 
the Painleve based MGF representation in Proposition 1, is extremely valuable since, as we 
have shown, it facilitates a "unified" investigation of the mutual information distribution under 
a wider range of scenarios than appear possible with previous existing tools. To the best of our 
knowledge, together with our recent work [24], this is the first time that such tools have been 
applied to problems in information theory. It turns out that these tools are also applicable to 
other problems in information theory and wireless communications, and such topics are currently 
being pursued. 

Appendix: Relation with Coulomb Fluid Method in [27] 

Here we draw the connections between our saddle point results and those derived based on a 
Coulomb fluid large deviation approximation in [27]. We start by recasting the formulation of 
[27] in terms of the MGF of the mutual information. To this end, consider the exact MGF (5) 
represented in multi-integral form: 

^^^^ ~ /m; nLi y^e-r^'^ U.<M - ^.)^dy ZiO) ^^^^^ 

where y = (yi, . . . , y„) denotes the eigenvalues of HH^^ and 

„ f n n '\ 

Z{X)= / exp<^ A^(l + P2/fc) + ^[(m-n)lnyfc-nyfc]+2^1n|yi-2/,| Idy . (134) 
[ k=i k=i i<j J 
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Based on the Coulomb fluid interpretation (see [45-49] for details), as n grows large, the CGF 
associated with (133) is anticipated to be well approximated with the following: 

/C(A) := InA^(A) ^ -rain F[a{y),\]+ mm F[a{y),0] (135) 

where 

pb pb 

F[a,\]= / a{y){n'[y-il3-l)\ny]}dy-Xn \n{l + Py) a iy)dy 

J a J a 

pb pb 

— I I \ii\y — z\a{y)a{z)(\ydz (136) 



a J a 



is the so-called "free energy", whilst a{y) is a PDF with support [a, h]. Meanwhile, we have the 
saddle point approximation of the density function: 

Px(yc;y){t) ~ exp{- max[tX - /C(A)]} , (137) 

A 

which, combined with (135), is equivalent to 

Px{x:y){t) ~ exp < — maxmin {F[cr, A] + tA} + minF[cr, 0] > . (138) 

Recalling that in the "large deviations" regime of interest, t ~ 0(n),A ~ 0{n) (see the 
discussions in Section VI- A), we scale t ^ nt, X ^ nX, and (138) becomes 

Px{x;y){nt) ~ exp {n^i— maxmin /[a, A, t] + min f[a, 0, 0] > > , (139) 



with 



rr ^ ^ F(nX) 

f[a,X,t] =^^ + tX 



Pb po 

= / cr{y){[y-{P-l)\ny])dy + X t- / + Py)a{y)dy 

pb pb 

- / / \n\y - z\a{y)a{z)dydz . (140) 

J a J a 

We find that (139) coincides exactly with [27, Eq. (23)], where the optimization problems 
max;^mino-(y)(-) are solved jointly, eventually resulting in three coupled non-linear equations 
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in general^. 



In contrast, our method first employs the Painleve equation to obtain the n-asymptotic CGF 

as 



with G{x) exactly characterized by (8). This representation corresponds to evaluating (135) 
explicitly, without any intuitive Coulomb fluid analogy. Then, armed with this CGF result, we 
separately draw upon the saddle point equation to capture the tail distribution, which corresponds 
to solving (137). Thus, in essence, our saddle point approximation is solving the equivalent 
problem to that considered in [27] by explicitly finding the asymptotic CGF (whilst in [27] it 
is implicit). Quite remarkably, under the high and low SNR regimes considered, it also leads to 
simplified results. 



[1] H. Bolcskei, R. U. Nabar, O. Oyman, and A. J. Paulraj, "Capacity scaling laws in MIMO relay networks," IEEE Trans. 

Wireless Cominiw., vol. 5, no. 6, pp. 1433-1444, Jun. 2006. 
[2] Y. Fan and J. Thompson, "MIMO configurations for relay channels: Theory and practice," IEEE Trans. Wireless Commun., 

vol. 6, no. 5, pp. 1774-1786, May 2007. 
[3] J. Wagner, B. Rankov, and A. Wittneben, "Large n analysis of amplify-and-forward MIMO relay channels with coiTelated 

Rayleigh fading," IEEE Trans. Inf. Theor)', vol. 54, no. 12, pp. 5735-5746, Dec. 2008. 
[4] R Dharmawansa, M. R. McKay, and R. K. Mallik, "Analytical performance of amplify-and-forward MIMO relaying with 

orthogonal space-time block codes," IEEE Trans. Inf. Theory, vol. 58, no. 7, pp. 2147-2158, Jul. 2010. 
[5] S. Jin, M. R. McKay, C. Zhong, and K.-K. Wong, "Ergodic capacity analysis of amplify-and-forward MIMO dual-hop 

systems," IEEE Trans. Inf. Theory, vol. 56, no. 5, pp. 2204-2224, May 2010. 
[6] M. Yuksel and E. Erkip, "Multiple-antenna cooperative wireless systems: A diversity-multiplexing tradeoff perspective," 

IEEE Trans. Inf Theory, vol. 53, no. 10, pp. 3371-3393, Oct. 2007. 
[7] D. Gesbert, S. Hanly, H. Huang, S. Shamai (Shitz), O. Simeone, and W. Yu, "Multi-cell MIMO cooperative networks: A 

new look at interference," vol. 28, no. 9, pp. 1380-1408, Dec. 2010. 
[8] H. Dahrouj and W. Yu, "Coordinated beamforming for the multicell multi-antenna wireless system," IEEE Trans. Wireless 

Commun., vol. 9, no. 5, pp. 1748-1759, May 2010. 

''For the special case, nt — rir, there is one non-linear equation for the right-hand tail and two non-linear equations for the 




n ^ OO 



References 



left-hand tail. 



Submitted to the IEEE Transactions on Information Tlieory (submitted June 2012). 



52 



[9] T. Liu and S. Shamai (Shitz), "A note on the secrecy capacity of the multiple-antenna wiretap channel," IEEE Trans. Inf. 

Theory, vol. 55, no. 6, pp. 2547-2553, Jun. 2009. 
[10] A. Mukherjee and A. L. Swindlehurst, "Robust beamforming for security in MIMO wiretap channels with imperfect CSI," 

IEEE Trans. Signal Process., vol. 59, no. 1, pp. 351-361, Jan. 2011. 
[11] F. Oggier and B. Hassibi, "The secrecy capacity of the MIMO wiretap channel," IEEE Trans. Inf. Theory, vol. 57, no. 8, 

pp. 4961-4972, Aug. 2011. 

[12] B. Chen and M. J. Gans, "MIMO communications in ad hoc networks," IEEE Trans. Signal Process., vol. 54, no. 7, pp. 
2773-2783, Jul. 2006. 

[13] A. M. Hunter, J. G. Andrews, and S. Weber, "Transmission capacity of ad hoc networks with spatial diversity," IEEE 

Trans. Wireless Coinmun., vol. 7, no. 12, pp. 5058-5071, Dec. 2008. 
[14] R. H. Y. Louie, M. R. Mckay, and I. B. Collings, "Open-loop spatial multiplexing and diversity communications in ad hoc 

networks," IEEE Trans. Inf. Theory, vol. 57, no. 1, pp. 317-344, Jan. 2011. 
[15] Y. Wu, R. H. Y. Louie, M. R. McKay, and I. B. Collings, "Generalized framework for the analysis of linear MIMO 

transmission schemes in decentralized wireless ad hoc networks," IEEE Trans. Wireless Commun., 2012, to appear. [Online]. 

Available: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6205593. 
[16] Z. Wang and G. B. Giannakis, "Outage mutual information of space-time MIMO channels," IEEE Trans. Inf. Theory, 

vol. 50, pp. 657-662, Apr. 2004. 
[17] H. Shin, M. Z. Win, and J. H. Lee, "Saddlepoint approximation to the outage capacity of MIMO channels," IEEE Trans. 

Wireless Commun., vol. 5, no. 10, pp. 2697-2684, Oct. 2006. 
[18] O. Oyman, R. U. Nabar, H. Bolcskei, and A. J. Paulraj, "Characterizing the statistical properties of mutual information in 

MIMO channels," IEEE Trans. Signal Process., vol. 51, no. 11, pp. 2784-2795, Nov. 2003. 
[19] R J. Smith, L. M. Garth, and S. Loyka, "Exact capacity distribution for MIMO systems with small numbers of antennas," 

IEEE Commun. Eett., vol. 7, no. 10, pp. 481^83, Oct. 2003. 
[20] R J. Smith and L. M. Garth, "Exact capacity distribution for dual MIMO systems in Ricean fading," IEEE Commun. Lett., 

vol. 8, no. 1, pp. 18-20, Jan. 2004. 
[21] A. L. Moustakas, S. H. Simon, and A. M. Sengupta, "MIMO capacity through coiTelated channels in the presence of 

correlated interferers and noise: A (not so) large TV analysis," IEEE Trans. Inf. Theory, vol. 49, no. 10, pp. 2545-2561, 

Oct. 2003. 

[22] B. M. Hochwald, T. L. Marzetta, and V. Tarokh, "Multi-antenna channel hardening and its implications for rate feedback 

and scheduling," IEEE Trans. Inf. Theory, vol. 50, no. 9, pp. 1893-1909, Sep. 2004. 
[23] W. Hachem, O. Khorunzhiy, R Loubaton, J. Najim, and L. Pastur, "A new approach for capacity analysis of large 

dimensional multiantenna channels," IEEE Trans. Inf. Theory, vol. 54, pp. 3987-4004, Sep. 2008. 
[24] Y. Chen and M. R. McKay, "Coulomb fluid, Painleve transcendents and the information theory of MIMO systems," IEEE 

Trans. Inf Theory, vol. 58, no. 7, pp. 4594-4634, Jul. 2012. 
[25] P. B. Rapajic and D. Popescu, "Information capacity of a random signature multiple-input multiple-output channel," IEEE 

Trans. Commun., vol. 48, no. 8, pp. 1245-1248, Aug. 2000. 



Submitted to the IEEE Transactions on Information Theory (submitted June 2012). 



53 



[26] A. L. Moustakas, S. H. Simon, and A. M. Sengupta, "Statistical mechianics of multi-antenna communications: Replicas 

and correlations," Acta Physica Polonica Series B, vol. 36, no. 9, pp. 2719-2732, Sep. 2005. 
[27] P. Kazakopoulos, P. Mertikopoulos, A. L. Moustakas, and G. Caire, "Living at the edge: A large deviations approach to 

the outage MIMO capacity," IEEE Trans. Inf. Theory, vol. 57, no. 4, pp. 1984-2007, Apr. 2011. 
[28] S. Li, Y. Chen, and M. R. McKay, "Mutual information distribution of interference-limited MIMO: A joint Coulomb 

fluid and Painleve based approach," in Proc. of the 4:5th Asilomar Conference on Signals, Systems and Computers, Pacific 

Grove, CA, USA, Nov. 6-Nov. 9, 2011, pp. 774-778. 
[29] L. Zheng and D. N. C. Tse, "Diversity and multiplexing: A fundamental tradeoff in multiple-antenna channels," IEEE 

Trans. Inf Theory, vol. 49, no. 5, pp. 1073-1096, May 2003. 
[30] i. E. Telatar, "Capacity of multi-antenna Gaussian channels," Eiirop. Trans. Telecommun., vol. 10, no. 6, pp. 585-595, 

Nov-Dec. 1999. 

[31] E. Biglieri, J. Proakis, and S. Shamai, "Fading channels: Information-theoretic and communications aspects," IEEE Trans. 

Inf Theory, vol. 44, no. 6, pp. 2619-2692, Oct. 1998. 
[32] A. Magnus, "Painleve-type differential equations for the recurrence coefficients of semi-classical orthogonal polynomials," 

J. Compiit. Appl. Math., vol. 57, no. 1-2, pp. 215-237, 1995. 
[33] Y. Chen and A. R. Its, "Painleve III and a singular linear statistics in Hermitian random matrix ensembles," J. Approx. 

Theory, vol. 162, no. 2, pp. 270-297, 2010. 
[34] E. Basor and Y. Chen, "Perturbed Hankel determinants," J. Phys. A: Math. Gen., vol. 38, no. 47, pp. 10 lOI-IO 106, 2005. 
[35] T. L. Marzetta, "Noncooperative cellular wireless with unlimited numbers of base station antennas," IEEE Trans. Wireless 

Commun., vol. 9, no. II, pp. 3590-3600, Nov. 2010. 
[36] H. Q. Ngo, E. G. Larsson, and T. L. Marzetta, "Analysis of the pilot contamination effect in very large multicell multiuser 

MIMO systems for physical channel models," in Proc. of IEEE International Conference on Acoustics Speech and Signal 

Processing, Prague, Czech Repulic, May 2011, pp. 3464—3467. 
[37] , "Energy and spectral efficiency of very large multiuser MIMO systems," IEEE Trans. Commun., submitted. [Online]. 

Available: http://arxiv.0rg/abs/l 112.3810. 
[38] F. Rusek, D. Persson, B. K. Lau, E. G. Larsson, T. L. Marzetta, O. Edfors, and F. Tufvesson, "Scaling up MIMO: 

Opportunities and challenges with very large arrays," IEEE Signal Processing Mag., 2012, to appear. [Online]. Available: 

http://liu.diva-portal.Org/smash/get/diva2:450781/FULLTEXT01. 
[39] B. Gopalakrishnan and N. Jindal, "An analysis of pilot contamination on multi-user MIMO cellular systems with many 

antennas," in Proc. of IEEE 12th International Workshop in Signal Processing Advances in Wireless communications, San 

Francisco, CA, USA, Jun. 2011, pp. 381-385. 
[40] A. Lozano, A. M. Tulino, and S. Verdu, "High-SNR power offset in multi-antenna communication," IEEE Trans. Inf. 

Theory, vol. 51, no. 12, pp. 4134-4151, Dec. 2005. 
[41] A. Grant, "Rayleigh fading multiple- antenna channels," EURASIP J. Applied Signal Processing, Special Issue on Space- 
Time Coding (Part I), vol. 2002, no. 3, pp. 316-329, Mar. 2002. 



Submitted to the IEEE Transactions on Information Theory (submitted June 2012). 



54 



[42] S. Blinnikov and R. Moessner, "Expansions for nearly Gaussian distributions," Astron. Astrophys. Suppl. Ser., no. 130, pp. 
193-205, May 1998. 

[43] H. E. Daniels, "Saddlepoint approximations in statistics," Aim. Math. Statistics, vol. 25, no. 4, pp. 631-650, 1954. 
[44] I. Avramidi, "Lecture notes on asymptotic expansion," Oct. 2000, [Online]. Available: http://www.nmt.edu/~iavramid/notes/ 
asexp.pdf. 

[45] F. J. Dyson, "Statistical theoiy of energy levels of complex systems I-III," / Math. Phys., vol. 3, no. 1, pp. 140-175, 1962. 
[46] Y. Chen and S. M. Manning, "Asymptotic level spacing of the Laguerre ensemble: A Coulomb fluid approach," J. Phys. 

A.:Math. Gen., vol. 27, no. 11, pp. 3615-3620, 1994. 
[47] Y. Chen and N. D. Lawrence, "On the linear statistics of Hermitian random matrices," Phys. A: Math. Gen., vol. 31, 

no. 4, pp. 1141-1152, 1998. 

[48] Y. Chen and S. M. Manning, "Distribution of linear statistics in random matrix models (metallic conductance fluctuations)," 

J. Phys.: Condens. Matter, vol. 6, no. 16, pp. 3039-3044, 1994. 
[49] Y. Chen and M. E. H. Ismail, "Thermodynamic relations of the Hermitian matrix ensembles," /. Phys. A: Math. Gen., 

vol. 30, no. 19, pp. 6633-6654, 1997. 



